Accepted to The Astrophysical Journal 

Preprint typeset using 1^1^^ style cniulatcapj v. 03/07/07 



00 

o 

O 
(N 



6 



> 

en 
m 
vn 

o 

00 
O 



X 



EAZY: A FAST, PUBLIC PHOTOMETRIC REDSHIFT CODE 
Gabriel B. Brammer\ Pieter G. van Dokkum\ and Paolo Coppi^ 

Accepted to The Astrophysical Journal 

ABSTRACT 

We describe a new program for determining photometric redshifts, dubbed EAZY. The program is 
optimized for cases where spectroscopic redshifts are not available, or only available for a biased subset 
of the galaxies. The code combines features from various existing codes: it can fit linear combinations 
of templates, it includes optional flux- and redshift-based priors, and its user interface is modeled on 
the popular HYPERZ code. A novel feature is that the default template set, as well as the default 
functional forms of the priors, are not based on (usually highly biased) spectroscopic samples, but on 
semi-analytical models. Furthermore, template mismatch is addressed by a novel rest-frame template 
error function. This function gives different wavelength regions different weights, and ensures that 
the formal redshift uncertainties are realistic. We introduce a redshift quality parameter, Qz, that 
provides a robust estimate of the reliability of the photometric redshift estimate. Despite the fact 
that EAZY is not "trained" on spectroscopic samples, the code (with default parameters) performs 
very well on existing public datasets. For iiT-selected samples in CDF-South and other deep fields we 
find a la scatter in Az/(1 -I- z) of 0.034, and we provide updated photometric redshift catalogs for the 
FIRES, MUSYC, and FIREWORKS surveys. 
Subject headings: cosmology: observations — galaxies: evolution — galaxies: formation 



1. INTRODUCTION 

Accurate redshifts of distant galaxies are crucial for 
nearly all of observational cosmology. Whereas exten- 
sive spectroscopy with multi-object spectrographs on 
8- 10m class telescopes has yielded redshifts for thou- 
s an ds, and in some cases tens of thousands, of galaxies 



e ^Steidel et al.ll200a iDavis et al.ll200a iLe Fevre et"an 



I2005D . these galaxies tend to be relatively bright at op- 
tical wavelengths. For galaxies fainter than _R ~ 25 we 
rely almost exclusively on photometric redshifts, derived 
from fitting temp late spectra to broad - or mediuni-band 
photometry (e.g. Lanzetta et al. 1996; 'Wolf et al.l 120031: 
Pranx et al 2003; Mobashcr et al. 2004; Dror v et all 
2005[) . This situation is not likely to change, even 



with the advent of efficien t spectrographs wi th very 
wide fields (such as WFMOS: [Bassett et al.ll2005[ ). muhi- 
object capabilities in the near-infrared (e.g. MOIRCS; 
IJchikawa et al. 2006), or larger telescopes. The signal- 
to-noise ratio (S/N) per resolution element in the contin- 
uum decreases with spectral resolution as S/N oc R~^-^ 
for a given exposure time. Therefore, the required in- 
tegration time to maintain a given S/N per resolution 
element increases linearly with the spectral resolution, 
quite independent of the details of the telescope and in- 
struments. As a typical set of broad band filters corre- 
sponds to i? ~ 5 and typical faint object spectrographs 
have R ~ 1000, spectroscopy is about two orders of mag- 
nitude more time consuming than photometry for a given 
telescope size. A notable exception is spectroscopy of 
emission line objects, which can be extremely efficient at 
faint magnitudes. 

The methodology for determining photometric red- 
shifts using the template-fitting approach is essentially 
straightforward: the photometric data are compared to 
synthetic photometry for a large range of template spec- 

^ Department of Astronomy, Yale University, New Haven, CT 
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tra and redshifts, and the most likely redshift follows 
from a statistical analysis of the differences between 
observed and synthetic data. Several codes exist that 
perform this task, each employing its own techniques 
for creating the synthetic photometry and interpreting 
the residuals in the redshift - template plane. Popu - 
lar exam ples include HYPE RZ (jBolzonella et al.ll2000f) . 
ImpZ (B abbedge eraI]|2004D . and Le PHARE^ (Arnouts 
& llbert), which do a straightforward x^ minimization; 
GREGZ^ (Rudnick et al. 2001, 2003), which allows fin- 
ear combinations of templates and uses Monte Carlo 
methods to d etermine the redshift uncertainties; BPZ 
(|Benitedl2000[ ). which uses Baye sian statistics al l owing 
the use of priors; and ZEBRA (Fcldmann e t all [20061) 
andkcorrect (Bla nton fc Roweis 2007, hereafter BR07), 
which include (distinct) iterative template-optimization 
routines that make use of the exten sive spectroscopic 
databases of the zCOSMOS (iLillv et a l. 2007) and Sloane 
Digital Sky Survey fSDSS: f^rk et al.ii200Q) projects, 
respectively. 

For obvious reasons photometric redshifts benefit from 
having high quality photometry in many bandpasses and 
from sampling strong continuum features in the observed 
wavelength region (such as a Lyman or Balmer break), 
irrespective of the methodology. However, given a set of 
objects with good quality photometry, the aspect that 
is of paramount importance for obtaining reliable photo- 
metric redshifts is the select ion of the template set (see 
Feldmann et al. 2006, ^O]) . iFeldmann et al.l (|2006D ob- 
tain very good results by iteratively adapting the tem- 
plates, minimizing the systematic differences between the 
best fitting templates and the actual galaxy photome- 
try. This approach not only reduces the random uncer- 
tainty in the photometric redshifts but can also elim- 

^ http : //www . oamp . f r/people/arnouts/LE_PHARE . html 
^ Greg Rudnick did not name his code; the name GREGZ is 
used for convenience in the present paper. 
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inate systematic effec ts in certain redshift ranges (see 
iFeldmann et al.l[2006f ). The disadvantage of this opti- 
mization is that its effects can only be assessed when a 
large sample of galaxies with spectroscopic redshifts is 
available, and when this sample is a random subset of 
the entire photometric sample. This assumption may be 
valid in the case of zCOSMOS, but this is generally not 
the case in studies of galaxy samples which are signifi- 
cantly fainter than the spectroscopic limit. 

In this paper we describe a new photometric redshift 
code which was written specifically for samples with in- 
complete and/or biased spectroscopic information (such 
as, for example, faint iiT-selected samples). Rather than 
minimizing the scatter in the relation between photo- 
metric and spectroscopic redshift using the spectroscopic 
sample as a training set, a user-defined template er- 
ror function is introduced to account for wavelength- 
dependent template mismatch. The code combines fea- 
tures from various existing programs: the possibility 
of fitting linear combinations of templates (as done in 
GREGZ), the use of priors (as first done in BPZ), and 
a user-friendly interface based on HYPERZ. The default 
template set and the redshift-magnitude priors are de- 
rived from semi-analytical models. These models are, 
of course, only an approximation of reality, but their 
"perfect" completeness down to very faint magnitudes 
outweighs their imperfect representation of the real Uni- 
verse. 

The outline of this paper is as follows. In f}2l we de- 
scribe the implementation of the code, including the op- 
timized template set and redshift priors derived from 
semi-analytical models and the template error function 
derived from the GOODS-CDFS photometric catalog. In 
fj3]we test the code on a combined photometric catalog 
from a variety of deep multi- wavelength surveys and com- 
pare the photometric redshifts to spectroscopic redshifts 
of nearly 2000 galaxies at < z < 4. In f}4] we discuss 
the reliability of the photometric redshift estimates and 
provide cautionary examples for relying solely on spec- 
troscopic samples to estimate the photometric redshift 
quality. Finally, in fj5] we summarize the features and 
performance of the photometric redshift code and dis- 
cuss future avenues for improvement. 

2. IMPLEMENTATION 

2.1. Basic Algorithm 

The basic algorithm is similar to many existing pho- 
tometric redshift codes. The algorithm steps through a 
user-defined grid of redshifts, and at each redshift finds 
the best fitting synthetic template spectrum by minimiz- 
ing 
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with iVfiit the number of filters, Tz.i,j the synthetic flux of 
template i in filter j for redshift z, Fj the observed flux 
in filter j, and SFj the uncertainty in Fj. Templates are 
corrected for abso r ption by intervening H I clouds fol- 
lowing the Ma daul (| 1995 1 prescription. The template fit 
is done in linear space, as this allows a proper treatment 
of flux errors and of negative flux measurements. 
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Fig. 1. — Distribution of K magnitudes of IC* galaxies from 
our simulated lightcone catalog. The _R"-band number counts of 
the two deep fields from the FIRES survey IILabbe et al.i i2003l : 
Forster Schreiber et al. 2006) are shown for comparison, scaled by 
the ratio of the number of galaxies with Ks < 23.5 in these fields 
compared to the lightcone catalog, mset; Redshift distribution of 
the galaxies in the lightcone catalog. The widths of the shaded 
FIRES distributions and the error bars on the histograms are the 
poisso n-like errors for small numbers calculated following Gehreli 
II1986I) . The simulated number counts are generally consistent with 
the observed values to within a factor of ~2. The simulated cat- 
alog provides a unique template calibration set that is complete 
at the limiting magnitudes characteristic of deep imaging surveys, 
and that includes thousands of galaxies at z > 2 where observed 
spectroscopic samples are sparse. 

In most photometric redshift codes (e.g. HY- 
PERZ, BPZ, ZEBRA) each template T, is a single- 
component empirical or synthetic spectral energy distri- 
bution (SED). However, in practice many galaxies are 
not well represented by any individual template from the 
user-supplied library, and as a result template mismatch 
is the primary source of error in photometric redshift 
estimates. In ZEBRA, the detailed form of each tem- 
plate is adapted iteratively based on residuals from fits 
to galaxies with spectroscopic redshifts. Instead, we fol- 
low GREGZ and allow linear combinations of templates. 
Rather than finding the best-fltting template Ti the code 
finds the best- fitting coefficients, a^, in 
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with all ai > 0. The number of template components 
fit simultaneously is one, two, or all of the templates in 
a user-defined list. For the one- and two-template fits 
the coefficients, ai, are determined using analytic least- 
squares fits, while for the latter option the coefficients for 
every template in the libr ary are de t ermin ed iteratively 
following the algorithm of ISha et al.l (|2007D . In practice, 
the choice between a 2-template fit and an A^-template 
fit is a trade-off between accuracy and computation time. 
The improvement going from two templates to N tem- 
plates can be significant if the number of templates in 
the library is small (< 5), but is usually negligible when 
the number of templates is large (> 10). 

2.2. Optimized tem,plate set 
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Fig. 2. — left: Five templates generated following the IBlanton &: RoweisI II2007I ) algorithm with PEGASE models and a calibration 



set of synthetic photometry derived from semi-analytic models. Shown in grey is the additional young and dusty template added to 
compensate for the lack of extrem e ly dus ty galaxies in the SAMs. All of th e templates are shown normalized at 6000A. middle: "Default" 
template set of'Blanton & Rowei^ 1)20071 ) . right: Empirical templates fromlColeman et al.l JlgsO) plus the "SBl" starburst spectrum from 
[Kinncv ct al. (1996) that are frequently used for photomet ric redshifts. T he templates shown are extended into the NUV and NIR with 
IBruzual fc Charloti ([20020 niodels (dashed regions; see, e.g. IRudnick et al.|[2001i) . 



The recent significant improvement in the quahty of 
maximum-likelihood photometric redshift estimates has 
come largely from optimizing the template set that is 
fit to the broad-band photometry. In practice, the tem- 
plate set needs to be large enough that it spans the broad 
range of multi-band galaxy colors and small enough that 
the color and redshift de generacies are kept to a mini- 
mum (e.g. iBenitezI [20001 ). The combined set of em- 
pirical galaxy templ ates from lColeman et all (|1980l ) and 
iKinnev et al.l ()1996f ) (hereafter, CK) is the most fre- 
quently used set for photometric redshift measurements 
(BPZ; ZEBRA; llbert e t al. 2006; Mobasher et al. 2001^, 
and it provides the additional benefit of allowing a rough 
estimate of an individual galaxy's spectral type as well 
as its redshift. The CK set suffers from a number of dis- 
advantages, however: it is determined from local galax- 
ies and is therefore not guaranteed (or expected) to be 
representative of galaxies at high redshift, and the tem- 
plates need to be extended into the UV and NIR for 
use with photometry from modern multi-wavelength sur- 
veys. Several of the recent photometric redshift efforts 
address the first issue by iteratively adjusting the basis 
CK templates based on fi ts to large broad-band photo- 
metr ic datasets (;ZEBRA: IHbertetaLl [20061: lAssef eFaLl 
l2008f ). While this technique is shown to significantly im- 
prove the quality of the redshift estimates, it requires 
a large spectroscopic redshift calibration sample and re- 
mains largely unproven at z > 1.5. 

We derive a new minimal template set based purely on 
stellar population synthesis models that is designed for 
deep optical-NIR broad-band surveys and that requires 
no optimization based on spectroscopic samples. The 
template set is calculated following the novel "nonneg- 
ative matrix factorization" (NMF) algorithm of BR07, 
which essentially takes a large number, A^in, of synthetic 
models and computes a reduced set of A^out basis tem- 
plates that best reproduce a supplied broad-band pho- 
tometric calibration catalog. The A'out basis templates 
are non-negative linear combinations of the A^m models 
and can be considered to be the "principal-component" 
spectral templates of the calibration catalog. For ex- 
ample, BR07 co mpute A^out = 5 bas i s tem plates from a 
hst of Nin = 485 IBruzual fc Charloti (|2003D models that 



efficiently reproduce a large sample of photometric ob- 
servations from the SDSS. 

While the SDSS offers precision photometric and spec- 
troscopic information for a spectacular number of galax- 
ies, it is limited to the nearby universe - which means 
that templates determined from (or optimized by) SDSS 
observations are subject to similar uncertainties when ex- 
tended to higher redshifts as those of the CK templates 
described above. Flux-limited spectroscopic samples are 
now a vailable u p to i? ftj 24.5 ( e .g. D EEP2. lDavis et all 
(|200l : VVDS. iLeFevre et all ([2005l) l. but deep imag- 
ing surveys reach well beyond the limits of these spec- 
troscopic surveys. In order to obtain a calibration sam- 
ple that extends to faint magnitudes and high redshifts, 
we turn to theoretical models of galaxy formation and 
evolution that are complete to the extent that they 
reproduce observed galaxy properties at high redshift. 
Specifically, we obtain synthetic UBVRIzJ HK "photom- 
etry" of galaxies in a 1 deg^ lightcone ([Blaizot et al.l 
120051) cre ated from galaxies in the s emi-analytic model 
(SAM) of 'Dc Lucia fc Bl aizot' (200f), which is based on 
the Millennium Simulation (Springelct al. 2005). Syn- 
thetic S£ectraleiierg;5;_distribu^ns (SEDs) are generated 
from IBruzual fc Chariot (|2003l) models following the 
non-trivial star formation histories of the galaxies in the 
semi-analytic model. While the models do not exactly 
reproduce the rel ative fractions of red and blue ga laxies 
at high redshift ([Marchesini fc van DokkumI l2007f) and 
the simplified t reatment of dust obscurat ion is proba- 
bly inadequate (jKitzbichler fc Whit3l2007t l. the models 
should contain a more representative sample of galaxy 
SEDs over the broad redshift range < z < 4 than purely 
local surveys. For the calibration set of the template op- 
timization routine, we randomly select a subsample of 
10^ galaxies with Kab < 25 from the lightcone catalog. 
The redshift and K magnitude distributions of galaxies 
in the calibration sample are shown in Figure [TJ 

For the input list of A^in models to the BR07 
NMF a lgorithm, we use the library of PEGASE 
models (iFioc fc Rocca-Volmerangj Il997| ) described by 
'Grazian et al.' (2006), which those authors use to obtain 
high-quality photometric redshifts over < z < 2 in the 
GOODS-South field. The library includes A^jn -- 3000 
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Fig. 3. — top panel: Normalized residuals (A_Fj = Tj j — Fj) of redshift /template fits to the full GOODS-CDFS photometric catalog 
HWuvts et al . 2008), shifted into the rest-frame using the photometric redshifts of each source, estimated using the "NMF" templates 
described in i|2.2l Only objects with a measured flux in all bands are shown. The dashed line indicates the median of the residual absolute 
values in wavelength bins with widths such that each bin contains ~2000 data points, bottom panel: Binned residuals (absolute value) 
shown with the vertical scale expanded. The dashed line is the same as in the top panel. The dot-dashed line is the median photometric 
error, cfj/Fj, multiplied by a factor of 0.67 to scale the errors from l-cr (68%) to 50% confidence intervals. We subtract in quadrature the 
median photometric error from the median observed residual in each bin to determine the contribution from "template-error" (filled circles). 
The solid line is our adopted template error function, which has been smoothed and extended into the NUV to avoid a discontinuity at 
the Lyman break. We extend the error function beyond the longest wavelength bin to have cy^; = 1.0 at A = 10 /im to account for the 
presence of dust emission in the NIR that is not included in the templates. 



models with ages between 1 Myr and 20 Gyr and having 
a variety of star formation histories including exponen- 
tially decreasing star formation rates characterized by 
the exponential decay rate, r*, and constant star for- 
mation models that are truncated. Roughly half of the 
models in the library are constant star formation models 
with additional reddening [0 .5 < E(B — V)< 1.1] applied 
using the extinction curve of lCalzetti et al.l (120001 ). which 
are designed to represent young, dusty objects. PEGASE 
models provide a self-consistent tre atment of emission 
lines, which are not included in the iBruzual fc CharlotI 
()2003() models. Though the synthetic photometry in the 
lightcone catalog does not include emission lines, we use 
the PEGASE template set with and without emission 
lines and find slightly better results with the output tem- 
plates when emission lines are included. 

The final A^out = 5 templates computed from the indi- 
vidual PEGASE templates as fit to the lightcone catalog 
are shown in Figure^ along with the BR07 and CK tem- 
plate sets for comparison. The two NMF-derived tem- 
plate sets span a larger range of optical- NIR colors than 
the CK set, and the NMF templates contain more infor- 
mation in the NUV than the simple power-law extrapo- 
lations of the CK set. The higher spectral resolution of 
the pure-model NMF templates is not likely to affect pho- 
tometric redshift estimates, but more realistic colors in 
the rest-UV should be important as optical filters sample 
this portion of the SED at moderate redshift, z >^1. The 
primary difference between the BR07 template set and 
the set derived here is the presence of the dusty (and old) 
template that is the reddest template in the BR07 set. 
Though there are models with significant dust absorp- 



tion in the PEGASE library, the NMF algorithm does 
not require a dusty template to fit the lightcone photo- 
metric catalog. This is likely due to the fact that the 
simple dust prescription in the lightcone model is unable 
to produce any extremely dusty galaxies. Therefore we 
add a dusty starburst model [t = 50 Myr, Ay = 2.75) to 
the set of 5 NMF-derived templates to compensate for 
the lack of dusty galaxies in the SAM calibration sample 
(Figure [2|) . While the parameters chosen for this addi- 
tional template are somewhat arbitrary, they are chosen 
such that the template fills in regions of the rest-frame 
color space not sampled by the 5 NMF-derived templates 
(Figure [2|) . In [J3] we compute photometric redshifts for 
a variety of publicly available optically- and iiT-selected 
photometric survey fields and we compare the quality of 
redshifts estimated using these three template sets. 

2.3. The template error function 

Multi- wavelength surveys frequently sample rest-frame 
wavelengths from the UV to the ncar-IR for galaxies 
at z < 4, and the quality of the calibration of pop- 
ulation synthesis models is not constant over that full 
wavelength range. This can be caused by a number of 
factors, such as (1) uncertainties in the stellar evolution- 
ary tracks; (2) transformation of the physical parameters 
of the models to observable quantities; (3) variations in 
the dust extinction law; and (4) stochastic spectral fea- 
tures that are simply not included in the mode l s. For 
example, relevant to item (1) above, iMarastonI ()2005f ) 
find that short-lived thermally-pulsating asymptotic gi- 
ant branch stars, which had not been previously properly 
included in isochrone synthesis models, can contribute 
significantly to the emergent NIR flux from stellar pop- 
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ulations younger than ^1.5 Gyr. Considering item (4), 
the presence/strength of emission lines depends strongly 
on properties of the ISM, which are only loosely coupled 
to the evolution of stars within a galaxy. For examp le, in 
the extreme case of Lyman-a, ISteidel et alJ |2000) find 
equivalent widths ranging over 1-2 orders of magnitude 
in emission and absorption for a relatively homogeneous 
sample of Lyman-break galaxies at z '^ 3. 

No single generahzed template set could hope to ac- 
count for all of these uncertainties, and it is therefore no 
surprise that corrections to a minimal template set are re- 
quired to optimize pho tometric redshifts for a given pho- 
tometric catalog fe.g. I llbert et al.l[2006[ ). As discussed 
in H2.21 such optimization requires an extensive calibra- 
tion set of galaxies with spectroscopically-measured red- 
shifts, which is often unavailable or at best incomplete 
for deep imaging surveys. Here we derive a "template 
error function" that seeks to incorporate uncertainties 
such as those mentioned above into the template fitting 
algorithm (Eq. [T]). Besides the calibration uncertainties 
described above, any set of individual templates will have 
difficulties reproducing the wide variety of star formation 
histories and dust extinction in galaxies. To make mat- 
ters worse, such properties not only vary among galaxies 
at a given cosmic epoch, but also vary systematically 
with time (or redshift). Along with allowing for multi- 
ple linear combinations of individual templates, the tem- 
plate error function developed here helps to account for 
these variations. The exact form of the template error 
function depends on the chosen set of templates, but it 
is computed in such a way that it generally applicable 
especially when no spectroscopic calibration sample is 
available. 

The template error function is derived in the follow- 
ing way. First, photometric redshifts are determined 
with a uniform tem plate erro r function (set at a constant 
0.05 mag; see, e.g. iRudnick et al.ll200lh . for the photo- 
metric catalog of t he GOODS-CDFS field described by 
IWuvts et all (|2008D . We use the CDFS because it pro- 
vides the deepest survey with extensive multi-wavelength 
coverage that includes the NIR IRAC bands.* Next, we 
calculate the residuals from the best-fitting model spec- 
tra and de-redshift them into the rest-frame. These resid- 
uals are shown in the top panel of Fig. [3] (after several 
iterations) . The binned median absolute values of these 
residuals are shown by the solid symbols in the bottom 
panel of Fig. [3l along with a smoothly varying function 
that is fit to the solid symbols (dashed line). Finally, 
the template error function (solid line) is created by sub- 
tracting in quadrature the (scaled) photometric errors 
(indicated by the dot-dashed line) from this smoothly 
varying function. The procedure is repeated until con- 
vergence is reached. 

The residuals in Figure[3]are only shown for bands with 
signal-to-noise (S/N) > 10. To test whether the derived 
error function depends on the S/N of the flux measure- 
ments, we also compute the template error for different 
limits 3 > S/N > 20. The sizes of the median resid- 

* Since we wish to determine how well the PEGASE template 
set matches observed data, we cannot use the synthetic lightcone 
photometry as that would only illustrate differences between SEDs 
produced by the PEGASE and lBruzual fc Charlol | |2003) popula- 
tion synthesis codes. 




Fig. 4. — Prior probabilities, p{z\mo), a function of observed R 
and K (AB) magnitudes. The shape of the priors is given by the 
model redshift distributions of galaxies in the lightcone catalog, 
normalized to J p{z\Tno dz = 1. 

uals and photometric errors decrease as the S/N limit 
increases, but we find that the quadratic difference of 
the two remains mostly unchanged. The shape of the 
template error function roughly follows what one might 
expect following the considerations enumerated above: 
the template error is lowest in the rest frame optical, 
A =3500 A- 9000 A, where stellar isochrones are well- 
calibrated; the template error is large in the UV where 
dust extinction is strongest and most variable; and the 
template error increases again in t he NIR where the stel- 
lar isochrones are uncertain (e.g. iMarastonI l2005f ) and 
where thermal dust emission and stochastic PAH line 
features begin to appear at A > 3/im. 

2.4. Bayesian Prior 

The template-fitting method of estimating photomet- 
ric redshifts suffers from the fact that template colors are 
frequently degenerate with redshift, such that the red- 
shift probability distributions can have multiple peaks 
over a broad range of redshifts. For example, relatively 
featureless blue SEDs can often be fit equally well at 
z — and z ^ 3 because the templates are unable to dis- 
tinguish blue colors redward of the Balmer and Lyman 
breaks, respectively. The degeneracies can sometimes be 
broken by adding additional photometric bands (in the 
previous example, adding IRAC photometry helps) or by 
incorporating statistical methods to help choose between 
multiple probab ility peaks at different redshifts. 

iBenitezI (|200Q ) was the first to develop a Bayesian ap- 
proach to estimating photometric redshifts that includes 
the use of a Bayesian prior, which adds additional in- 
formation besides the observed photometric c olors to 
help c onstrain the redshift estimates. Following iBenitea 
()2000( l we adopt an apparent magnitude prior, p{z\mo), 
which is the redshift distribution of galaxies with appar- 
ent magnitude, too- This is essentially a "luminosity- 
function-times-volume" prior that assigns a low proba- 
bility to very low redshifts where the volume sampled is 
small and a similarly low probability of finding e xtremely 
bright galaxies at high redshift. In contrast to iBenited 
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TABLE 1 



Field 

HDF-N 

HDF-S 

MS-1054 

MUSYC 

GOODS-CDFS 



Area (arcmin ) 



Bands 



K depth (3ct) 



Ref. 



~5 UBVIJHK 

4.5 UBVIJHK + IRAC 

29 UBVVeo6hi4JHK + IRAC 

400 UBVRIzJHK + IRAC 

138 [UasBV RI]wFilBV I z] ACS JHK + IRAC 



~24 Fernandez-Soto, Lanzetta, & Yahil (1999) 

26.2 Labbe et al. (2003) 

25.5 Forster Schreiber et al. (2006) 

23.5 Quadri et al. (2007), Marchesini et al. (in prep) 

24.9 Wuyts ct al. (2008) Popesso et al. (2008) 



(|200Cll ). we do not include spectral (template) type in the 
prior because (1) our derived templates do not directly 
correspond to individual galaxy spectral types; (2) we 
fit linear combinations of all 5 templates simultaneously; 
and (3) we do not want the prior to impose any color 
restrictions as a function of redshift, the last poi nt being 
most important. For example, the prior used bv iBenitea 
(|2000f ) based on the HDF-N gives essentially zero prob- 
ability to red E/SO spectral types at z > 2, even though 
recent work has shown that such galaxies ar e fairly com- 
mon, at least in NIR-selected samples (e.g. iKriek et al.l 

To determine the shape of the prior probability dis- 
tribution, p(z|too), we again turn to the synthetic pho- 
tometry of the SAM lightcone catalog described in i i2.2l 
because this problem is subject to many of the same com- 
pleteness limitations of observed samples as the template 
optimization routines. In principal, one could iteratively 
determine redshift distributions from observed data and 
then recompute photometric rcdshifts using the distri- 
butions as the prior, but observed samples are generally 
small at high redshift and the iterative method is not 
guaranteed to converge to the truth. Though the syn- 
thetic models do not perfectly reproduce observed data 
(e.g. Figure [T|), they should be able to reasonably esti- 
mate p(z|too) over < z < 4, since in practice it is only 
the shape of p(z|77io) that matters in a given mo bin and 
not the overall normalization of the number of galaxies 
in that bin. We adopt a functional form of the prior. 



p(z|mo,i) oc z''' exp 



z 
zo,i 



(3) 



(jBenited l2000[ ) and fit the parameters, ji and zo,i, for 
the redshift distributions in each magnitude bin, tuoa- 
Because the lightcones contain many galaxies at high 
redshift, the functional fits do not extrapolate at high 
redshift, but rather they ensure that the shape of the 
prior is smooth over the entire redshift range. Figure 2] 
shows p{z\mo) for two selection bands, R and K, deter- 
mined from the full 1 deg^ lightcone catalog of ^ 10^ 
galaxies. 

2.5. Output redshifts and confidence intervals 

With the tabulated values of the prior, we can now 
compute the posterior redshift probability distribution 
for each galaxy, given the galaxy's observed colors, C, 
and apparent magnitude, mo: 

p{z\mo,C) (xp{z\C)p{z\mo), (4) 

(|Benited [2OOOI ) where p(z|C) == exp [-x^ (z) /2] is the 
likelihood computed from the template fits (EqslU 12) 
over a user-supplied redshift grid. Given the posterior 



probabilities, the code produces two redshift estimates, 
and Zm.n, where z„ is simply the redshift where the 



^mpt 



probability is at its maximum and z„ip is the value of 
the redshift marginalized over the posterior probability 
distribution, 



/ z p{z\C,mo)dz 
J p{z\C,mo) dz 

For a gaussian probability distribution. 



^mp- 



(5) 



In 



practice z„p smooths out some small-scale systematic er- 
rors apparent in Zpijot"-Zspcc comparisons and z„p allows 
the use of a coarse redshift grid to speed up the execu- 
tion of the code without significant loss of precision in 
the output redshift estimates. 

We compute formal lower and upper confidence limits, 
zio and Zup, for a confidence level, a, by integrating the 
posterior probability distribution from the edges until the 
integrated probability is equal to a/2: 



p{z\C, mo) dz, 
p(z|C, mo) dz, 



(6) 



where the limits (0, 00) are replaced in practice by 
user-specified parameters, (zmin,zmax), and 1-, 2-, 
and 3-(T confidence limits are computed with a — 
0.317, 0.046, 0.003, respectively 

2.6. Software 

The algorithm is implemented in a public software pro- 
gram, dubbed "EAZY" (for "Easy and Accurate Red- 
shifts from Yale"). The user-interface of EAZY was 
modeled on the popular HYPERZ code, but the under- 
lying code is written independently. EAZY is controlled 
through a parameter file whose defaults should provide 
good redshifts for most applications. We provide the op- 
timized template set, template error function, and priors 
described above as default inputs, but the code accepts 
any user-defined version of these files in simple ASCII 
formats. The code is fast, ta king about four min utes to 
run the 6300 galaxies in the IWuvts et all (|2008l ) FIRE- 
WORKS GOODS-CDFS catalog with linear combina- 
tions of the six default templates ( ij2.2p on an Apple Mac- 
Book running a 2 Ghz Intel Core 2 Duo processor. For a 
redshift test grid, < z < 6; Az = 0.01 (1 + z), EAZY 
requires 12 s compared to 270 s for HYPERZ (without 
fitting reddening) to run the entire CDFS catalog when 
fitting only the single best-fit template of the six (but see 
Figure [6|). The EAZY package, along with installation 
instructions, example files and a user's manual, can be 
obtained from http : //www . astro . yale . edu/eazy/. 
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Fig. 5. — left: Spectroscopic vs. photometric redshifts computed for 5 surveys with deep optical-NIR photometry, shown on a "pseudo- 
log" scale, right, top: Residuals, Az = Zphot — ^^spec, as a function of ^spcc. The red datapoints indicate cr[z) scaled by a factor of two in 
bins that each contain A' = 200 sources and have widths indicated by the horizontal bars. The blue points indicate the median residual in 
the same bins, also scaled by a factor of two to magnify the low-level systematic effects, right, bottom: Same as the top panel, but plotting 
the "surface density" of points rather than the points themselves to highlight systematic effects. 



3. APPLICATION 

We have combined a diverse set of public photometric 
catalogs to test the quality of the photometric redshifts 
computed by EAZY. In this section, we compare photo- 
metric redshifts to a large sample of spectroscopic red- 
shifts. We stress that we do not use the spectroscopic 
sample to calibrate the photometric redshifts explicitly, 
because, as discussed earlier, our spectroscopic sample 
is not necessarily representative of the full photometric 
samples. Rather, we use the spectroscopic sample to 
illustrate how certain systematic effects depend on fea- 
tures of the code implemented as described in ij2l 

3.1. Combined test sample 

To adequately test the code, we require photometric 
catalogs with broad multi-wavelength coverage from the 
UV through the NIR to produce unbiased photometric 
redshifts over a broad redshift range. Deep NIR pho- 
tometry is essential for uncove ring the complete popula - 
tion of galaxies at z > 1.5 fe.g. lvan Dokkum et al.|[2006l ). 
and is correspondingly important for estimating photo-zs 
as the Balmer break shifts into the NIR bands at these 
redshifts. Table [T] summarizes the photometric data we 
use with EAZY to compute photometric redshifts'^. All 
of these catalogs provide ?7-band photometry necessary 
to break the Lyman-break degeneracy between z ~ 
and z ^ 3, and the catalogs represent most of the deep- 
est public NIR photometry available. The optical pho- 
tometry for 4/5 of the photometric catalogs comes from 
the Hubble Space Telescope (HST). W e use the CDFS- 
GOODS catalog of IWuvts et all (|2008f) which combines 
the deep HST GOODS photometry with ground-based 
UBVRI photometry from the ESO Deep Public Survey 
(|Arnouts et al.ll2001h . The NIR JiJX photometry comes 
from a variety of ground-based facilities. Additionally, 

5 We provide Zphot catalogs for the FIRES, MUSYC, and FIRE- 
WORKS surveys at http://www.astro.yale.edu/eazy/ that su- 
percede the Zphot estimates provided by the catalog references 
hsted in Table[t] 



the majority of the sample is observed in the four IRAC 
bands on the Spitzer Space Telescope. Sources in the 
HDF-N are selected in the /-band (WFPC2-F814W), 
while sources in the other four fields are all selected in 
ii'-band images. 

Though the photometry in the fields listed in Ta- 
ble [1] reaches significantly deeper magnitudes than the 
practical spectroscopic limit, these fields have been ob- 
served with extensive follow-up programs that provide a 
large sample of spectroscopically-measured redshifts. We 
have collected a sample of 1989 spectroscopic redshifts 
from references listed in Table [1] covering the full range 
< Zspec ^ 4. We use only the most reliable redshift 
quality flags when they are available in the spectroscopic 
catalogs. The spectroscopic sample contains 334 galax- 
ies at Zspec > 1-5 that have a variety of spectral types, 
including L yman break galaxie s with blue rest-frame col- 
ors (LBGs: ISteidel et al.|[2003f) and Distant Red Galax- 
ies w i th quite r ed rest-frame colors (DRGs; IFranx et al.l 
[Mil iKriek et al. 2006 ) . 

3.2. Results with default param,eters 

Figure [5] shows that Zphot estimated by EAZY agrees 
remarkably well with Zgpec over the entire redshift range 
covered by the spectroscopic sample. The same code 
parameters are used for all fields, and no additional tem- 
plate or photometric optimizations are done based on 
the Zs-pcc~z-p\iot comparison. We use the normalized me- 
dian absolute deviation (ctnmad) of Az = Zphot — ^spoc 
to quantitatively assess the quality of the photometric 
redshifts, with 



cnmad = 1-48 X median 



Az — niedian(Az) 



-f V Zgpc 



(7) 



With this definition, ctnmad is equal to the standard 
deviation for a Gaussian distribution. An advantage 
of this definition is that it is less sensitive to outliers 
than the usual de finition of the standard deviation (e.g. 
Illbert et al.l 120061 ). Hereafter we drop the subscript for 
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Fig. 6. — Residuals, Az = Zphot ~ z^-pcc^ for different combinations of input parameters. The scatter, a, and tlie number of ^u outliers 
are indicated in each panel with the outliers divided into N > 5cr (superscript) and N < —5cr (subscript), left, top: identical to Figure 
\5\ The "best" parameter set fits all of the PEGASE templates simultaneously ( i|2.2l l and inc l udes the template error function ( i|2.3l l: left, 
bottom: Template error not used; center, top: kcorrect templates from[Blanton fc RoweisI l|2007l ) : center, bottom: Empirical templates 
from lColeman et al.l l|1980l ) and lKinnev et al.l l|1996l) : right, top: Pairs of PEGASE templates, rather than all simultaneously; right, bottom: 
Single PEGASE templates. 



clarity {a — (Tnmad)- The scatter, a, is nearly con- 
stant as a function of Zgpeo with a — 0.034 for the entire 
spectroscopic sample. The scatter does increase above 
^spcc > 1-5 where a = 0.075. Systematic deviations 



from the Zpho 



line are very small at most red- 



shifts, with the exception that Zphot underestimates Zgpoc 
at z = 1.0 — 1.4 by ~5% (median). Figure [5] shows a rela- 
tively small number of sources that have estimated Zphot 
very different from Zgpcc- These "catastrophic outliers" 
(defined here to have A2;/(l + z) > 5a) make up 5% 
of our spectroscopic sample. We note that we have not 
performed any cuts on the spectroscopic sample based 
on signal-to-noise or coincidence with X-ray sources that 
could indicate the presence of an AGN, both of which 
could contribute to a poor estimate of Zphot- In §4.21 we 
describe how the catastrophic outlier fraction can be de- 
creased using observables computed from the Zphot fit. 

3.3. Effects of changing the default parameters 

We have computed Zphot for the spectroscopic sam- 
ple using different combinations of input parameters to 
demonstrate the effects that the features of the code im- 
plementation (fj2]) have on the quality of the computed 
Zphot- In a separate paper, E. N. Taylor et al. (in prep) 
present additional quantitative tests that show not only 
how the EAZY 2:phot estimates depend on changing in- 
put parameters such as the template set, but how the 
science results based on those redshifts — specifically the 
evolution of rest-frame colors and stellar masses of red 
galaxies over < z < 2 — also vary systematically with 
different code inputs. The residuals, Az, for the different 
■Zphot sets are shown in Figure [6l The "best" reference 
parameter set fits the 6 PEGASE NMF templates simul- 
taneously and includes the template error function. Fig- 
ure [5)3 shows the residuals for fits that do not include the 
template error function; the scatter is somewhat higher 
than when the template error is used, and systematic ef- 
fects appear as a function of Zgpoc- Galaxies at z < 1 have 
Zphot overestimated by 10%. The number of catastrophic 



outliers is similar whether or not the template error func- 
tion is used, though when the template error function is 
not used the majority of outliers have Zphot >> ^spoc- 
This is potentially problematic for science applications 
that would be adversely affected by bright, low-z galax- 
ies scattering into high-z samples (e.g. luminosity func- 
tions). The number of 5cr outliers is nearly constant in all 
of the panels of Figure [S] however the number of sources 
with Az/{1 + z) greater than some fixed value, e.g 0.2, is 
significantly lower for the best template/parameter set. 

Figures [6t,d show residuals for Zphot computed us- 
ing the BR07 and empirical CK template sets, respec- 
tively. The differences between fits using the PEGASE 
and BR07 templates are small, though the systematic 
effects are somewhat worse when using the BR07 tem- 
plates: galaxies at Zgpec > 1-5 have Zphot systematically 
low by Az ~ 0.2. The scatter is significantly higher 
when using the empirical templates compared to either 
synthetic template set. The Zphot estimated with the em- 
pirical templates are also systematically underestimated 
at 0.5 < Zgpoc < 1.5. This effect has been observed pre- 
viously in other photometric redshift studies and it has 
often been "cured" by corr ecting the templates b ased on 
the input photometry (e.g. iFeldmann et"al]l2006l ). How- 
ever, we demonstrate here that our carefully-determined 
synthetic template set (Figure [5]) itself greatly reduces 
these systematic effects without requiring any additional 
corrections based on spectroscopic calibration samples. 

Figures [5j3 and [5f show residuals for Zphot for iVtemp = 
2 and 1, respectively (Eq. [2]), given the 6 templates of the 
PI5GASE NMF set. The primary effect of fitting muhi- 
ple templates simultaneously is a striking reduction in 
a. This technique does not allow for the simple spec- 
tral classification provided by single-template fits, how- 
ever the increased precision of the photometric redshifts 
should allow more physical separations of photometric 
samples based on, for example, rest frame colors. 

Figure [7] shows how the incorporation of the redshift 
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Fig. 7. — Effect of the prior. The sources where the prior moved 
the maximum HkeUhood redshift by Uprior — ^ml | > 0.03(l+2spcc) 
are indicated with a large point at ^phot = ^prioj and a tail extend- 
ing to 2phot = ^ML- The eases where the prior improves Zphot a^re 
shown in green, and the cases where the prior incorrectly pushes 
^phot away from Zspoc are shown in red. 



prior affects the Zphot estimates. While the prior gener- 
ally improves ^phot, it does not efhciently discriminate 
between multiple probability peaks in all cases. For 
example, there is a handful of galaxies in the spectro- 
scopic sample at Zgpcc ~ 3 but with Zphot ^ 0.2 resulting 
from the degeneracy between fitting the Balmer break 
at low redshift and the Lyman break at high redshift. 
The prior breaks this degeneracy for the single case indi- 
cated, but does not work for the remaining galaxies with 
discrepant ^phot- The prior does, however, affect more 
sources with similar degeneracies from the full photo- 
metric sample (without Zspoc). Figure [5] indicates that 



there are "clouds" of sources in zml 



space that 



follow the behavior indicated by only one or two sources 



with measured 



'Spec- 



For example, there is a cloud of 



sources with zml ^ 0.2 and Zprior ~ 3. The galaxies 
in this cloud likely have similar colors that all result in 
the same redshift dege neracy of the template fits (see, 
e.g. lOvaizu et al.ll2007l ) . If the single source with a mea- 



sured 



'Spec 



is representative of this group, than Zpi-jor is 



likely closer to the true redshift for all of the sources in 
this group. The opposite could be true for the group 
of sources at zml > 3.5 and ^prior < 1, where the two 



sources with 



'Spec 



indicates that zml is likely a better 



estimate of the true redshift than Zpiior- Nearly all of the 
"clouds" in Figure [S] have one or two counterparts with 
measured -Zspec, so such a figure could be used to choose 
the optimal Zphot estimate for all of the sources in the 
full photometric sample. 

3.4. Comparison to neural network redshifts 

If our assumption that the synthetic photometry of 
the lightcone catalogs is representative of true galaxy 
photometry over < z < 4 f iJ2.2p . then the synthetic 
photometry could perhaps be used to train a neural net- 
work that could estimate photometric redshifts indepen- 
dently of our template-fitting approach. We use the 
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Fig. 8. — Maximum likelihood Zphot vs. Zphot after including 
the prior (Eq. |4] for the full photometric sample of galaxies from 
the surveys listed in Table [T] Sources with measured Zspec and 
where Uprior ~ ^mlI > 0.03(1 -fZspoc) are indicated, as in Figure[7] 
Here, the tails point from [x, y] = [zml, Uprior] to [zapcc, Zspoc]- 
Therefore, horizontally oriented tails indicate cases where the prior 
improves Zphot- Cases where zml 



than . 

are not shown. 



-spec 



but both are different 
which would lie along the one-to-one line on the figure, 



ANNz code (iCollister fc Lahavll2004D to train a commit- 
tee of five 11:10:10:1 neural networks directly on 11-band 
{UBVRIzJHK + IRAC1,2) lightcone catalog. The 
photometric redshifts estimated for the validation sample 
(a random subset of the lightcone catalog) have a ^ 0.03, 
indicating that the network training works reasonably 
well. We test the neural network on the observed pho- 
tometry of the MUSYC HDFS-l field, which contains 
114 spectroscopic redshifts over < z < 3 and whose 
filter transmission curves are identical to those used to 
compute the synthetic lightcone photometry. 

The photometric redshifts estimated by (EAZY, 
ANNz) have a = (0.046,0.100) for the full MUSYC- 
HDFS spectroscopic sample and a = (0.075, 0.105) for 18 
galaxies at Zgpec > 2. Along with the increased scatter, 
the ANNz photometric redshifts show systematic errors 
that are not seen in the EAZY redshifts. These discrep- 
ancies are likely caused by the fact that the neural net- 
work technique depends more critically on the assump- 
tion that the training catalog has identical properties to 
the full data catalog. For example, CoUistcr & Lah^ 
(2004) point out that ANNz is able to account for the 
internal redenning of SDSS galaxies in their training and 
validation samples, but if the dust model of the light- 
cones is incorrect, the neural networks trained on them 
will have systematic problems matching real observa- 
tions. These sorts of problems also affect the template- 
fitting approach, though our additional dusty template 
and the template error function are designed to address 
such systematic uncertainties. 

4. RELIABILITY OF PHOTOMETRIC REDSHIFTS 

4.1. Confidence intervals 
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EAZY 



We can assess how well the confidence intervals com- 
puted by Eq. [7] reflect the ^phot uncertainties by observ- 
ing how often the measured Zgpoc falls within the inter- 
val. For example, 41% of sources have Zgpec outside of 
the 68% confidence interval, while 32% are expected . 
In general, however, Zgpoc lies close to the edge of the 
confidence intervals, and expanding the 68% confidence 
interval by a factor of only 0.01 x (1 -I- Zgpcc) decreases 
the number of discrepant sources to 29%. We find that 
the probability distributions, p{z\C,mo), and therefore 
the confidence intervals computed by Eq. [7] for individ- 
ual sources, are consistent with the Zphot distributions 
of Monte-Carlo simulations in which we measure Zphot 
after perturbing the photometric fluxes within their as- 
sociated uncertainties. In general, we conclude that the 
confidence intervals provide a reasonable representation 
of the uncertainties of the ^phot estimates. 

4.2. Reliability parameter 

There is a small number of sources where Zspoc lies 
well outside even the 99% confidence intervals. These 
sources usually have sharply-peaked probability distri- 
butions, so no alternate definition of Eq. [7] (e.g. 
iFernandez-Soto et al.|[200^ or Monte-Carlo simulations 
could significantly improve the confidence intervals. 
Catastrophic outliers can be caused by (a combination 
of) a number of factors: (1) intrinsic SEDs that are not 
well- reproduced by the template set, (2) degeneracies 
in color-z space that result in multiple peaks in p{z)- 
especially for very blue galaxies with featureless SEDS- 
(3) one or more anomalous photometric measurements, 
or (4) simply that the spectroscopic source was incor- 
rectly matched to a photometric source or that the red- 
shift was misidentified in the spectrum. Quantitative 
features of the Zphot At can be used to identify catas- 
trophic outliers caused by one or more of the p rob- 
lems described above. For example, iBenited (|2000[ 1 de- 
fines a parameter, pAz, that represents the fraction of 
the total integrated probability that lies within ±Az of 
the ZpYiot estimate, and is designed to identify sources 
that hav e broad and/or multi- modal probability distri- 
butions. iMobasher et al.l (|2007f ) find that the Zphot scat- 
ter is an increasing function of a parameter, D95 that 
is defined as the ratio of the 95% confidence interval to 
(l-l-Zphot). Here we define a paramet er, Qz, that is a hy- 
brid of the parameters proposed by IBenited ()2000l ) and 
IMobasher et al.l (|2007D . and also includes the x^ of the 
template fit: 



Q. - 



X 



^99 



^99 



A^filt -^ PAz 



(8) 



The inclusion of x^ should allow us to address the "catas- 
trophic" cases (1-3) above. Figure [9] shows the Zphot 
residuals and cr as a function of Qz ■ We show Qz-cr for 
the spectros copic sample an d also for a simulated sam- 
ple following Bcnite^ (|2000( ). where we fit Zphot for each 
source in the CDFS photometric catalog; set the tem- 
plate colors of the fit at Zphot to be the new photometric 
colors; add photometric scatter following the photomet- 
ric errors; and finally refit Zphot for the new synthetic 
photometry. The Zphot scatter increases sharply above 
Qz = 2 — 3 in both the synthetic and observed spectro- 
scopic samples. The 5 — cr outlier fraction at Qz > 2(3) 
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Fig. 9. — Redshift residuals as a function of the redshift quality 
parameter, Qz (Eq. [S]!. The red line indicates cr^Q^), scaled by a 
factor of 10 so that variations are visible on the plot scale. 

is 0.3 (0.4), SO quality cuts based on Qz can eliminate 
a large fraction of the outliers at the expense of a small 
number of satisfactory Zphot estimates. We have verified 
that quality cuts based on Qz are independent of red- 
shift, that is, cutting on Qz > 3 does not preferentially 
remove only high-z sources. 

4.3. The false security of Zphot ^ -^spcc plots 

We have emphasized that EAZY has been designed to 
estimate photometric redshifts of galaxies in deep photo- 
metric surveys that lack representative calibration sam- 
ples with measured spectroscopic redshifts. Such a sit- 
uation will be the case for the latest generation of large 
NIR surveys that wi ll reach K ~ 25 (e.g. UKIDSS; 
iLawrence et al.l l2007| ) and probe galaxies with L * lu- 
minosities at z ~ 3.5 (e.g. iMarchesini et al.ll2007f) . Al- 
though EAZY (and other codes) performs very well in 



^phot 



plots, many systematic effects are "hidden" 



in such diagrams. This is implicit in Fig. (5) this Fig- 
ure demonstrates that there are large groups of sources 
whose photometric redshifts are very sensitive to the de- 
tails of the optimization routine, but that this behavior 
is only "sampled" by, at-best, one or two sources with 
measured Zgpoc- A single outlier, such as the object at 
(zspec = 4.5, Zphot = 1-25), might not appear notewor- 
thy in a plot like Figure [71 but it could represent a large 
number of objects. 

We explore this effect further by c omparing Zph o t com - 
puted for the full CDFS catalog of iWuvts et all (l20n8h 
and for a perturbed version of the same catalog. For 
the perturbed catalog, we add random zeropoint offsets 
to each of the photometric bands with a maximum off- 
set of 5%, and we remove the J-band from the Zphot 
fit. The Zphot for the normal and perturbed catalogs 
are shown in Figure 1101 The small zeropoint offsets 
cause systematic effects visible as kinks in Figure [101 be- 
tween < Zphot < 1- Without the J-band, the tem- 
plate fits cannot efficiently isolate the Balmer break at 
1.5 < z < 2.5, and the degeneracy of the Zphot fits at 
these redshifts are visible in Figure ITUl Now we consider 
how these systematic effects are traced by the subsample 
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Fig. 10. — Comparison of Zphot computed before and after 
adding random zerop oint error s of up to ~5% to the CDFS photo- 
metric catalog of W uvts et al.l 12008) . The full photometric sam- 
ple is shown in the greyscale contours, while the red points indi- 
cate galaxies with sp ectroscopic redshifts measured by the VVDS 
ILe Fevre et al.ll2005l) . The contour levels are the same as those in 
Figure \8\ 



of galaxies from a flux-limited spectroscopic survey, in 
this case the / < 24 spectroscopic survey of the VVDS 
(|Le Fevre et al.l [20051) . At redshifts where the spectro- 
scopic survey is complete, here z < 1.5, the spectroscopic 
sample effectively traces the systematic effects of the full 
photometric sample and the sources with spectroscopic 
redshifts can be used to calibrate the photometric red- 
shift algorithms. There are a few galaxies at z > 2 in 
the spectroscopic sample, but their Zphot are nearly iden- 
tical from both the normal and perturbed photometry. 
One would not be able to distinguish between the two 
catalogs — or similarly bet ween two separate p hotomet- 
ric redshift algorithms fHil debrandt et al]l2008f l — if each 
produces the same Zphot estimates, even if the Zphot es- 
timates suffer from serious systematic uncertainties for 
sources beyond the flux limits of the spectroscopic com- 
parison sample. 

Similar effects are demonstrated in Fig. [11] Here, we 
did not perturb the zero points, but mimicked the filter 
set of the POWIR survey ^Consehce et al.ll2007i) . which is 
one of the largest -RT-selected surveys to date. Again, the 
spectroscopic sample (in this case, redshifts down to the 
DEEP 2 limit R < 24.1) would suggest reasonably robust 
redshifts for either the full CDFS {a = 0.037) or partial 
POWIR (a = 0.048) filter sets at 1 < z < 2. Consider- 
ing the entire CDFS photometric sample however, it is 
apparent that there are significant discrepancies between 
the two filter combinations at z > 1.25. These degenera- 
cies have important implications for the interpretation of 
high-z galaxy samples, for example surface densities in 
two bins, 1.5 < z < 2 and 2 < z < 2.5, differ by more 
than a factor of 2 depending on the filters used to es- 
timate Zphot- These discrepancies would likely be more 
pronounced in surveys with limited filter coverage that 
are also significantly shallower than the CDFS photom- 
etry. 




1 2 3 

2phoi (all bands) 



Fig. 11. — Comparison of Zphot computed for the full CDFS 
filter set and for a subset of the filters, BRIJK (the filter set used 
in the POWIR survey; Consclicc et al. 2007). CDFS sources with 
measured Zspec and R < 24.1 corresponding to the limit of the 
DEEP2 spectroscopic limit are shown in the large red points. 



5. SUMMARY 

This paper presents a new photometric redshift code, 
dubbed "EAZY". The philosophy of the code is differ- 
ent from other recent photometric redshift codes, in that 
it does not aim to minimize the scatter in Zphot - Zspcc 
comparisons. This type of minimization works well if 
the spectroscopic sample is a random subset of the pho- 
tometric sample, but may lead to erroneous results if the 
spectroscopic sample is biased. In K- or IRAC-selected 
samples the vast majority of objects is very faint in the 
observer's optical (see, e.g. Ivan Dokkum et al.ll2006l ) and 
the subset of galaxies with a spectroscopic redshift is 
highly a- typical. We therefore develop a new template 
set ( ij2.2p based on synthetic photometry of galaxies in a 
semi-analytic model that is essentially complete at red- 
shifts significantly beyond the reach of current spectro- 
scopic surveys. Furthermore, we introduce a "template 
error function" ( §2.3p that accounts for both random 
and systematic differences between observed photome- 
try and the templates and minimizes systematic errors in 
■Zphot without the need to optimize either the templates 
or the photometry based on a spectroscopic calibration 
sample. The template set and template error function 
provided here are intended to be generally applicable to 
NIR-sclccted samples. With these default parameters 
and without further optimization, we find that the scat- 



ter m Zpho 



diagrams (ct = 0.034) is at least as low 



as achieved by other methods and systematic errors are 
minimal over the full range < z < 4 (ij3]). 

The reliability of the uncertainties in the redshifts is 
almost as important as the reliability of the redshifts 
themselves. The uncertainties that our code provides 
behave well for galaxies with a spectroscopic redshift (in 
the sense that the confidence intervals correctly describe 
the deviations from the true redshifts), but we cannot 
test the behavior in the same way for the majority of 
objects without a spectroscopic redshift. Assessing the 
reliability of photometric redshifts for a sample in which 
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the spectroscopic redshifts are not a representative sam- 
ple of the full photometric sample can be misleading (!|4]). 
This is especially true when only limited filter coverage is 
available and large systematic effects can be present that 
are not clearly traced by the spectroscopic sample. In 
practice, we expect that the "Qz" parameter ( §4.2p gives 
a reasonable indication of the robustness of a redshift. 

Progress in the study and interpretation of faint galaxy 
samples is currently limited by the uncertainties in pho- 
tometric redshift estimates. What is sorely needed is a 
complete set of faint, i^-selected galaxies with reliable 
redshifts, so that photometric redshifts can be better 
calibrated. At relatively bright i^-magnitudes progress 
can be expected from near-IR spectroscopy, particularly 
with multi-object near-IR spectrographs. Initial results 
of such programs already point to problems with some 
photometric redshift estimates (jKriek et al.l l2008l ). al- 
though we note that the Kriek et al. sample is well-fit by 



EAZY. Deeper samples may be obtained through differ- 
ent methods. A program is underway with NEWFIRM 
on the Kitt Peak 4m telescope to obtain medium band 
photometry for a complete sample of ~ 10^ galaxies with 
K < 21.5, and this should provide much- needed tests of 
the broad-band results reported in the literature. 



The authors are grateful to Gabriella de Lucia and 
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also thank Stijn Wuyts, Ned Taylor, Rik Williams, and 
Ryan Quadri for extensive tests of early versions of the 
code, and Gregory Rudnick and Marijn Franx for valu- 
able suggestions and comments. 
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